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Abstract —In this work, we formulate and study a data 
dissemination problem, which can be viewed as a generalization 
of the index coding problem and of the data exchange problem 
to networks with an arbitrary topology. We define r-solvable 
networks, in which data dissemination can be achieved in r > 0 
communications rounds. We show that the optimum number 
of transmissions for any one-round communications scheme is 
given by the minimum rank of a certain constrained family 
of matrices. For a special case of this problem, called bipartite 
data dissemination problem, we present lower and upper graph- 
theoretic bounds on the optimum number of transmissions. For 
general r-solvable networks, we derive an upper bound on the 
minimum number of transmissions in any scheme with > r 
rounds. We experimentally compare the obtained upper bound 
to a simple lower bound. 

Index Terms —Data dissemination, data exchange, index cod¬ 
ing. 

I. Introduction 

A problem of index coding with side information considers 
a communications scenario with one broadcast transmitter 
and several receivers. All receivers possess some partial 
information available to the transmitter and request additional 
information. The goal is to design a communications scheme, 
which minimizes the total number of transmissions. 

Index coding problem was proposed hrst in 0 : it was 
suggested therein to use coding in order to minimize a 
number of transmissions. Later, in l|3l, the minimum number 
of transmissions in the index coding problem was shown to 
be equal to the minimum rank of a properly dehned family 
of matrices. Generally, computing the minimum rank of a 
family of the matrices is an NP-hard problem, yet in some 
special cases there exist efficient algorithms to compute it El, 
0 . 

Index coding problem was intensively studied in the recent 
years, see for example fl], 0 , Jb), Q, |[T 8 ], 121 ]. It was 
shown in a, mni that index coding problem is equivalent 
to a network coding problem 0 , (HI]. In index coding, 
however, the underlying network graph is very simple, it is a 
directed star graph, where the transmitter is the root of that 
graph. 

A variation of index coding, termed data exchange prob¬ 
lem, was studied in im. In the data exchange problem, unlike 
the index coding problem, every node can serve as both a 
transmitter and a receiver. The underlying network graph 
is a complete directed graph. Before the communications 
take place, each node possesses some partial information. 
The goal is to deliver all information to all the nodes in 
a minimum number of transmissions. It was shown in ED 


that the minimum number of transmissions in the data ex¬ 
change problem can also be described as a rank minimization 
problem of a certain constrained family of matrices, thus 
resembling some of the results for index coding. 

Another related problem is a set reconciliation E2, El, 
II20I . II22I . IID, 124 ]. The set reconciliation problem is 
usually dehned over a network of arbitrary topology, either 
wired or wireless. In that problem, similarly to the data 
exchange problem, the goal is to deliver all information to all 
the nodes. However, by contrast, it is assumed that no node 
knows what information is possessed by the other nodes. This 
makes the set reconciliation problem more difficult than the 
data exchange problem. 

In this work, we introduce a data dissemination problem, 
which further generalizes both the index coding and the 
data exchange problems, such that the underlying directed 
connectivity graph of the network is an arbitrary graph. This 
model, in particular, represents cached networks of arbitrary 
topology. The data dissemination problem can also be viewed 
as a generalization of the set reconciliation problem. In data 
dissemination problem, every node can serve as both a trans¬ 
mitter and a receiver. Moreover, each node possesses some 
partial information and requests some additional information. 

Example I.l. Consider an example network in Figure [7] 
There are five nodes Vi, V2, V3, U4 and V5, which in total 
possess three bits of information xi, X2, x^. If Vi transmits 
Xi -b X2 and V2 transmits X2 + X3, then the requests of all 
nodes will be satisfied with only two transmissions. 

Vl V2 

has xi, X2 has X2, X3 



requests X2, X3 requests xi, X3 requests X2 

Fig. 1. Example network 

In this work, we present the following results. First, 
we formulate and study a data dissemination problem. We 
dehne r-solvable networks, in which data dissemination can 




be achieved in r > 0 communications rounds. We show 
that the optimal number of transmissions for any one-round 
communications scheme is given by the minimum rank 
of a certain constrained family of matrices. For a special 
case of this problem, termed bipartite data dissemination 
problem, we present lower and upper graph-theoretic bounds 
on the optimum number of transmissions. For general r- 
solvable networks, by using similar techniques, we derive 
an upper bound on the minimum number of transmissions in 
> r rounds. We experimentally compare the obtained upper 
bound to a simple lower bound. 

II. Notation 

Denote [n] = {1,2,-•• ,n} (in particular, [0] denotes the 
empty set). We use 0 to denote the all-zero vector, when the 
length of the vector is clear from the context. Similarly, we 
use ei to denote the unit vector which has 1 in position i and 
zeros everywhere else. We assume hereafter that all vectors 
are column vectors. 

Let F be a finite field F^, where g is a prime power. Take 
A to be a matrix over F. Denote by the i-th row of 
A and by {A)ij the entry in the i-th row and j-th column 
of A. We use the notation rowspace(A) to denote the row 
space of the matrix A, and notation A® B for the standard 
tensor product of the matrices A and B. For the row vector 
V = {vi,V 2 , ■ ■ ■ , Vn), we denote by diag(t;) the nxn matrix 
as follows: 

(diag(^)).. = { 0 other^iL ' 

Fix an ambient vector space L C F". Let FF be a subspace 
of V. The orthogonal vector space of W is given by 

W'^ = {v G L I Vtu €W : n • in = 0} , 

where v ■ w denotes the inner product of the two vectors. 

Let U,W C V he two vector subspaces. Define 

U + W = {u + w\ uGU and tu G FF} C L . 

If U nFF = {0}, then we also write t/0FF instead of U -l-FF. 

Let Qiy, £) be a directed graph with the vertex set V and 
the edge set £. For each f G V, introduce the notations 

= {u G V : {v,£) e £} 

and Mout{£) = {v €V : [£,v) € £} . 

Let E be the all-one square matrix. The size of E will 
be apparent from the context. Similarly, let I be the identity 
matrix. Finally, denote by 1„ the all-one column vector of 
length n. 

III. Problem setup 

First, we define the data dissemination problem. Consider 
a wireless network with a topology given by a finite directed 
connected graph 0{V,£), where V = [fc] is the set of nodes 
and £ is the set of edges. Let x = (xi,... x„) G F" be an 
information vector. Each node i G V possesses some side 


information consisting of the symbols Xj, j G Vi C [n], and 
is interested in receiving all of the symbols Xi, i G Ti C 

[nm. 

In the data dissemination problem, the goal is to find a 
coded transmission protocol with the minimum number of 
transmissions, such that all nodes could recover all their 
respective requested symbols. However, unlike in ifTTl . the 
network might not have full connectivity. 

Throughout this paper we make the following assumptions. 

• The graph Q is an arbitrary directed graph. 

• All transmissions are broadcast, i.e. the messages trans¬ 
mitted by the node i are always received by all the nodes 
in J^outi£)• 

• The coding is linear, i.e. each node i G V transmits 

messages of the form where G F for 

all i G [n]. 

• There is a central entity that knows Q and all the sets Vi 
and Ti for all i gV. This entity is running an algorithm 
for finding an optimal communications scheme. 

• The transmissions are without errors and interference, 
i.e. all transmissions are received correctly. This can 
be achieved by separately handling error-correction and 
interference in the lower layers. 

• There are no parallel edges in the network graph. This 
assumption simplifies the analysis, yet similar analysis 
can be done if the graph has parallel edges. 

Hereafter, we also assume that the transmissions are per¬ 
formed in rounds. During a round, each node transmits linear 
combinations of the symbols it possesses, but it can not use 
any symbol received in the same round. 

In Figure |2] we formally define Protocol P. Let r be an 
integer parameter, which denotes the number of rounds in the 
protocol. For all ^ G V and for all * = 0,1, 2, • • • , r, denote 
by Q!f^ the set of symbols in F possessed by the node (. at 
the end of round i (here, denotes the set 

of symbols possessed by the node I before the execution of 
the protocol). 


For i = 1 to 7’ do { 

Transmitting phase: each node £ G V broad¬ 
casts Ti^f, linear combinations = 

d'x,i,£,j ' X, J = 1, 2, • • • , Ti^l- 

^ (i) 

Receiving phase: each node £ G V updates ’ = 

U ,Ti „ • 

} 

Recovery phase: each node £ G V computes Xj = 
Px,£,j ■ X for all j G Ti. 


Fig. 2. Protocol P. 

Here, the integers Ti^i denote the number of symbols in 
F transmitted by the node £ G V in round i G [r]. The 
coefficients Px,i,£,j G F are chosen to multiply the symbols 
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X € in the j-th linear combination, j € ^ €V, 

i G [r], and similarly Hx/j G F are chosen to multiply the 
symbols x £ Q\ ’ in the j-th linear combination, j £%. 

We note that both the index coding problem HI and the 
data exchange problem m can be viewed as special cases 
of Protocol P, for r = 1. 

Definition III.l. Consider a network based on the graph 
Q{V,S). The assignment of the sets Vi and % is called 
feasible if for any j G %, i G V, there exists a node £ € V 
with j G Vi, such that there is a finite directed path from (. 
to i in Q. 

It readily follows from Definition IIII. 1 1 that for any feasible 
assignment there exists a selection of the integers Ti^i and of 
the coefficients p,x,i,i,j and p,x/j in Protocol P, such that 
all requests are satisfied in a finite number of rounds (for 
example, by simply forwarding the requested bits in Q, so 
all the coefficients are either 0 or 1). 


Define the family A of (kn) x n block matrices as: 

r Ai 1 



Afc 


( 2 ) 


Given A € A, the j-th nxn sub-matrix of A will be denoted 
as Aj. We will also use the notation to denote the 

dn X n matrix 




4 

■^12 


L A.. 


where = {*i, * 2 i''' i and d is an in-degree of i 

in g. 

For each £ S V, we define an n x n information matrix 

Pi = {Pi)i^[n]je[n], 


Definition III.2. The network based on the graph g(V, S) 
is said to be r-solvable, r € N, it is strongly connected 
and for any feasible assignment of the sets Vi and 71, i G 
V, r communications rounds are sufficient for the protocol 
to satisfy all the node requests, but r — 1 rounds are not 
sufficient. If the network is not r-solvable for any r S N, 
then we say that it is not solvable. 

Lemma III.l. The network is r-solvable for some r G^ if 
the maximum of the shortest length of the directed path from 
the node i to the node £ in Q, for any two nodes GV, is 
exactly r. 

The proof of this lemma appears in the appendix. 


/p X ^ r 1 if i = j and i G Vi 
Q otherwise 

Similarly, for each £ G V, we define an n x n query matrix 

Ti = (Tf)iG[n].ieN. 

1 if i = j and i G 71 
0 otherwise 

In a, it was shown that if the network is a star graph, 
then the min-rank of the side information graph yields the 
optimal number of transmissions. We extend this result to all 
networks where the transmitters and receivers may possess 
and request arbitrary information bits such that it is possible 
to perform the transmissions in one round. 


We define the transposed k x k integer adjacency matrix 
D of the graph g{V,£) as follows: 




1 if {j, i) g£ 
0 otherwise 


Corollary III.2. The network is r-solvable if r is the 
smallest integer such that all the entries in the matrix 
are strictly positive. 


IV. Optimal solution for one round data 

EXCHANGE PROBLEM 

Let the graph C/(V,F), the information vector x = 
{xi,... Xn) G F", and the sets Vf, and 7^ for £ G V be 
defined as above. We represent a matrix family A over F as 
a matrix over F U {A’}, where is a special symbol. The 
entry, which can take any value from F in A is marked as 
‘V. 

For each node £ G V, we define the family A^ of n x n 
matrices as follows. 

^ I 0 otherwise 


Theorem IV.l. Consider a wireless network defined by 
the graph g(y,£). Let A be an nk x n matrix family 
defined as above. For all nodes £ G V, let Pi and Ti 
be the corresponding possession and query matrices. Then, 
the minimal number of transmissions in Protocol P needed 
to satisfy the demands of all nodes in V in one round of 
communications is 


T = mm 
AeA 


rank {Ai) \ , 


(3) 




where for all i GV 


rowspace 



3 rowspace(r^) . 


(4) 


If the above matrix A G A as above does not exist then there 
is no algorithm that satisfies all requests in one round. 

The proof of this theorem appears in the appendix. 


It is straightforward to see that when (7 is a complete 
directed graph, then the result of Theorem IIV. 1 1 is equivalent 
to what is shown for the data exchange problem in uni 
Section V]. When £/ is a directed star graph, then this result is 
equivalent to the min-rank optimization for the index coding 
problem as in a. 
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V. Graph-theoretic bounds 

In this section, we consider a restricted special case of the 
data dissemination problem, termed bipartite data dissemina¬ 
tion problem, where the underlying network graph Q{V,£) 
is a bipartite graph, with V = AUB, AnB = 0, where 
the set ,4 is a set of transmitters, the set B is the set of 
receivers, and £ C A x B. Additionally, assume that the 
information vector is x = (xi,... Xn) S F", and \B\ = n. 
For all £ G ,4 we have Vi = [n] and Ti = 0, i.e. the nodes 
in A serve as transmitters only, and they all have all Xi’s. We 
assume w.l.o.g. that B = [n], and furthermore, for the sake 
of simplicity, we also assume that for each £ G B, Ti = {£}. 
Observe that the case, when |yl| = 1 and there exists an 
edge (i,£) G £ for i G A and all £ G B, corresponds to the 
well-known index coding problem a. 

Next, we recall some known facts from the literature. 

Definition V.l (lEl). For an index coding instance as 
above, a side information graph is the directed graph FL = 
{V'n,£'n) with the vertex set Vu = ond the edge set 
£-h = {(*, j) I 3 G V^}. 

If the side information graph is symmetric, i.e. 

Vi,j G [n] : {i,j) G £n ^ {j,i) G £n , 

then we can view it as the corresponding undirected graph. 

Definition V.2 (ifThlU. Let FI be a graph with a vertex set 
Vu = [n]. We say that a n x n-dimensional 0—1 matrix 
M fits FL if for all i G [n] we have = 1, and for all 

i,j G [n], {M)ij = 0 if there is no edge (i,j) in Fi. 

The minimum rank of graph Q is defined as 

min-rank 2 [Ff) = nm {rank2(A4)| matrix M fits FL} , 

where rank2(A4) denotes the rank of the binary matrix M 
over F 2 . 

Theorem V.l (H). For any undirected graph FL, 

a {FL) < 0 {FL) < min-rank 2 {FL) < cc {FL) , 

where a {FL) is the size of the largest independent set in FL, 
0 {FL) is the Shannon capacity ofFL 4761/ . and cc {FL) is the 
minimum click cover size of the graph FL. 

Theorem V.2 (El). Consider an instance of index coding 
problem over F 2 represented by a side information graph FL. 
The minimum number of transmissions in any solution for 
this instance using linear code is given by min-rank 2 {FL). 

Take an instance of a bipartite data dissemination problem. 
We define a side information graph FL for that problem 
analogous to Definition lV.il namely a side information graph 
FL = {V-fi^Bn) is a graph with the vertex set V-h = [n] and 
the edge set £-h = {{i,j) \ j G Vi}. 

Below, we derive new graph-theoretic upper and lower 
bounds on the optimal number of transmissions for the 
bipartite data dissemination problem. We start with the lower 
bound, given by the following lemma. 


Proposition V.3. The optimal number of transmissions in 
Protocol P for a bipartite data dissemination problem is at 
least min-rank 2 {FL), which is in turn bounded from below 
by 0 {FL) and by a {FL). 

Proof: We convert the given instance of a bipartite data 
dissemination problem into an instance of index coding, by 
replacing all the transmitters in A by one super-transmitter, 
and by adding edges from this super-transmitter to all the 
nodes in B. The optimal transmission scheme for the original 
data dissemination problem is also a valid transmission 
scheme for the corresponding index coding problem. The 
side information graph for both instances is exactly the same. 
The optimal number of transmissions in the later scheme is 
bounded from below by min-rank 2 {FL) due to Theorem lV.2l 
and the claim follows. ■ 

Next, consider an instance of a bipartite data dissemination 
problem with the underlying network graph Q{V,£), where 
V = .4U6, .4nS = 0, and £ 0 A x B. Assume that 
1^1 = t and let Sp = { 01 , 02 ,-■■ , 0 t}, 0 ^ = {V„T), 
be a set of graphs induced by the partition V = Uig[t]Vi, 
all Vi are disjoint, such that for all i G [/] there is 
|-4nVi| = 1. Since there are no parallel edges, we obtain that 
I {(u, b)\{v, b) G £i} I = 1 for all i G [f] and all 6 G 6 fl Vi. 

Lemma V.4. For all Oi as above, i G [t], consider an 
induced instance of index coding problem with a transmitter 
in .4n Vi and the set of the receivers BClVi. For each £ G Vi, 
the sets Vi and Ti are defined exactly as in the original 
problem. Denote by FLi, i G [f], the corresponding side infor¬ 
mation graph. Then, the optimum number of transmissions in 
Protocol P for the given bipartite data dissemination problem 
is less or equal to 

t 

min-rank 2 {FLi) ■ 

i=l 

Proof: Let Q{V,£) be an underlying network graph of 
a bipartite data dissemination problem, and let be as de¬ 
fined above. We construct a new bipartite data dissemination 
problem, as follows. The network graph G{V, £) for the new 
problem is given by 

B = UigpjFi C £ . 

For each £ G V, the sets Vi and Ti are defined exactly as in 
the original problem. 

The optimum number of transmissions for the original 
problem is less or equal to the number of transmissions 
in any solution for the new problem, because the optimum 
solution for the new problem is a valid solution to the 
original problem. Since there are no edges connecting nodes 
in different graphs Gi, the optimum solution to the new 
problem is obtained as a combination of optimum solutions 
to each of the index coding problems induced by the graphs 
Gi, i G [£]. Therefore, the optimum number of solutions to 
the new problem is given by 

t 

min-rank 2 {FLj) , 

i=l 
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and it serves as an upper bound on the optimum number of 
transmissions for the original problem. ■ 

Denote by P the set of all graph partitions = 
{Gi,G 2 ,--- ,Gt}, Gt = such that V = all 

Vi are disjoint, |./ln Vi| = 1 for all i G [f], and £i is a set of 
edges induced by Vi in G- For a partition let be a 

side information graph of the index coding problem induced 
by the vertex set Vi in G- By minimizing over all such Cp G P, 
and by applying Theorem lV.il we obtain the following result. 

Corollary V.5. The optimum number of transmissions for 
the given bipartite data dissemination problem is less or 
equal to 


Example VI.l. For a fixed t gV, let 


'k 

0 


k 

0 

k: 

0 


k 

0 

k 

0 


k 

0 

k 

0 


k 

0 

k 

0 

k 

k 

0 


After replacing the symbols we obtain 


T{K) 


1 0 0 0 0 
0 0 10 0 
0 0 0 1 0 
0 0 0 0 0 
0 0 0 0 0 


t t 

miny^ min-rank2 (TfifT?)) < minY^ cc('Hi(‘B)) 

i—1 2=1 

= mince , 

tpgp 

where (*P) is the graph obtained by the union of the graphs 

nfiv), i € M- 

VI. Data exchange protocol extension to many 

ROUNDS 

In this section, we consider a more general scenario. Here, 
the underlying network graph I/(V, £) is an arbitrary directed 
graph. For each node i G V, we require that Vi U % = [n]. 

In the Section IIVI only a single round of communications 
was required. If the underlying network graph is an arbitrary 
directed graph, then several rounds of communications may 
be needed. The goal is to minimize the total number of 
communicated bits while the number of rounds is bounded. 

Proposition VI.l. For a node i G V, and for i G [n], 
denote by di(xi) the length of the shortest path from a 
set of vertices having Xi in their possession to i. Let 

= Y^iaTt 

dmax = maxd|. (5) 

fev 

Then, the minimum number of transmissions in any protocol 
for data dissemination problem is at least dmax- 

The proof of this proposition appears in the appendix. 

Let the matrix families A^, for i gV, he as defined in ([T]), 
and A be as defined in (|2l). 

Definition VI.l. The maximum rank of the matrix family 
A is defined as 


Algebra of matrix families: Denote by F* the alphabet 
FU{‘*’}. In what follows, we represent families of matrices 
over F as matrices over F*. In the sequel, we define oper¬ 
ations on the matrices over F*, in a way which allows to 
describe algebraically the data dissemination in the network. 
In particular, we define two operations, the addition ‘+’ and 
the multiplication ‘-’of two elements a, 6 G F*, in such way 
that if a, 6 G F, then these operations coincide with usual 
addition and multiplication in the field F. 

Addition and multiplication of two elements, where at least 
one of the elements is are given in the following tables. 

Addition table; 


+ 

b 

k 

a 

a + b 

k 

k 

k 

k 


Multiplication table: 


• 

0 

5^0 

k 

0 

0 

0 

0 

a 0 

0 

a ■ b 

k 

k 

0 

k 

k 


(7) 


where a and b are any two elements in F. 

The addition and multiplication operations over F* can be 
naturally extended to operations on matrices over F*. 


Example VI.2. Let a 3 x 3 matrix B over the field F and 
a 3 X 3 matrix family A over F be given by 


'i 

1 

o' 



k 

0 

o' 

1 

1 

1 

and 

A = 

0 

0 

k 

0 

1 

1 



_o 

k 

o_ 


By multiplying the matrix family A from the left by the matrix 
B, we obtain: 


max-rank(A) = maxrank(A.). 

Given the matrix family A^, we define an operator r(-), 
which replaces the symbols in the maximal number of the 
first rows with linearly independent canonical vectors, and 
replaces the symbols in the remaining rows with zeros. 

Similarly, operator r^(-), ^ G V, takes as an input the 
possession matrix from A and returns r^(A) = r(A^). 


o 

1—1 

1—1 


O 

o 


bt 0 ^ 

1 1 1 


k 

o 

o 

= 

’A' 

0 1 1 


0x0 


0 bkr Ar 


We also define the multiplication of integer matrices by 
the matrix families over F. 

Remark VI.l. In what follows, we use a binary operation 
of matrix multiplication, where one of the arguments is 
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an integer matrix and the second argument is a family of 
matrices over F, and the result is a family of matrices over F. 
In order to be able to do so, by slightly abusing the notation, 
we use the product of an integer matrix with a matrix over 
F*, according to the rules defined in @ and (O. The result of 
this operation is a matrix over F*, which can be interpreted 
as a family of matrices over F. 


Assume that there is an edge (i,£) S £ for some i € V, 
and that the node i has Xj. Then, ^ f- 0 and cnj = In 
that case, we obtain 0^ ^ = This correctly represents the 
situation that the node i delivers Xj to the node 1 . 

We conclude that the matrix {D ■ A) (g) 1„ correctly 
represents the possession matrix of the graph Q after one 
round of execution of Protocol P. ■ 


Example VI.3. Let a 3 x 3 integer matrix B be 


B = 


1 2 
4 5 
0 7 


0 

6 

8 


and A be a 3 X 3 matrix family over F as in Example WL2\ 
Multiplying B by A yields 


B-A = 


1 2 0 
4 5 6 
0 7 8 


*00 
0 0 * 
0*0 


* 0 * 

'k 'k ir 

0 k k 


A. The role of adjacency matrix 

Lemma VI.2. Let A be the possession matrix as defined 
in Equation (O. Let D be the adjacency matrix of the graph 
Q. Let E be an n X n identity matrix. It is possible to choose 
integers Ti/ and coefficients Px,i,e,j <^nd Px,e,j in Protocol 
P, such that after performing one round of the protocol, the 
new possession matrix A+ is related to A as 


A+ = {D®E)- A. 

Proof: From the definition of A in Equation (|2]i, the 
matrix families A^, * € V, have n identical rows. Then, we 
can write Ai = a[^^ (g 1„, where a[^^ is a row vector over 
F* of length n which consists of a single row of Ai. From 
the definition of the tensor product, we have A = A ® 1„, 
where 



Lemma IVI.2I can be naturally extended to protocols with 
several communications rounds. In the sequel, we denote by 
A(®\ i S N, the possession matrix after the i-th round of 
the protocol. For the sake of convenience, we also use the 
notation A^^^ = A. 

Corollary VI.3. It is possible to choose integers Ti^e 
and coefficients Px,i,e,j ond Px,e,j in Protocol P, such that 
the possession matrix after the i-th round of the protocol 
execution is given by 

aW = (D* (8>£;) • a(°). 


B. Data dissemination using rank optimization 

The following theorem is the main result of this section. 


Theorem VI.4. Let Q be an underlying directed graph of 
an resolvable network defined by the adjacency matrix D^. 
Let A be the corresponding possession matrix of the network. 
Then there exists, for a suitable choice of Ti^g,, Px,i,e,j nnd 
Protocol P with r rounds, for any r > rg, and r 
transmissions, where 


= E 


mm 
A(Oe(r>^-i(g)£;)-. 




(9) 


. iev 


for matrices which are subject to 


Vj € V : rank 


(diag (j 


®l'\ ■ Ad) 


V,{{D^®E) 

= max-rank ((diag(ej) (g I) ■ ^ E) ■ A) , (10) 


The right hand side of the claim can be re-written as 

(D g f;) • A = (D (g e;) ■ (A g) 1„) 

(D ■ A) g (f; • 1„) 

= (D • A) g (nl„) 

(F).A)gl„. 

Here, the transition (*) is due to the properties of the tensor 
product, and the transition (**) is due to Remark rVI.il 
Next, assume that 

A = , D = (df.jEgy E)-A = (dej'j ^gy . 

jeH jev 

By using tables in (|6ll and (I?]), for all i €V,j G [n], we 
have 

^ ^ • Ciij. (8) 

lev 


where the matrices I and E are both n x n. 

The proof of this theorem appears in the appendix. 

We note that Theorem IVI.4I is an existence result, and 
therefore it gives an upper bound on the optimal number 
of transmissions. It does not claim any optimality, though. In 
general, it is possible that the number of transmissions given 
by (|9]l is not optimal. 

VII. Experimental results 

In this section, we describe experimental study of the 
tightness of the bound in Theorem IVI.4I The instance of 
the problem consists of two main ingredients: the adjacency 
matrix of the graph and the possession matrix of the network. 
We generate the adjacency matrix of the graph randomly, 
while fixing the number of vertices in the graph and the 
diameter. We also generate randomly the possession matrix 
of the network. 
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Range 

[1,1.2) 

[1.2,1.4) 

[1.4,1.6) 

[1.6,1.8) 

[1.8, 2.0) 

[2.0, oo) 

Occurrence, % 

54 

22 

6 

4 

0 

14 


Fig. 3. The efficiency of the algorithm for graphs of diameter 2 


Range 

[1,1.2) 

[1.2,1.4) 

[1.4,1.6) 

[1.6,1.8) 

[1.8,2.0) 

[2.0, oo) 

Occurrence, % 

30 

18 

24 

0 

6 

22 


Fig. 4. The efficiency of the algorithm for graphs of diameter 3 


In general, enumeration of all the matrices in a matrix 
family has exponential complexity. In order to facilitate this 
process, we use a randomized algorithm. It picks random 
matrices from a given matrix family, and then checks if that 
matrix satishes the conditions of the theorem. We use two 
different types of networks: in the hrst case the diameter of 
the graph Q was two, and in the second case it was three. In 
both cases, the number of nodes was 4 and the number of 
information bits was 4. 

For each randomly chosen network, we compute the 
number of transmissions guaranteed by Theorem IVI.4I over 
the binary held and the lower bound on the number of 
transmissions in Proposition IVI. 1 1 We compute the ratio of 
these two quantities. The tables in Figures[3]and|4]present the 
distribution of this ratio. In order to compute the maximum 
rank of a matrix family, we use the algorithm in iflTll (see 
also ini). 
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IX. Appendix 
Proof: (Lemma I///. 71) 

I) Consider a protocol, where in each round, each node 
broadcasts all the symbols Xj that it has in its 
possession (including the messages that it received 
in the previous rounds). Pick some £ G V. Let 
{vo,vi,V 2 , ■ ■ ■ ,vt = £) be a shortest path from vq to 
£ of length t < r. Then, after i rounds the node Vi 
obtains all the symbols Xj that vq has in its possession. 
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Therefore, after r rounds, i has all the messages 
possessed by all the nodes in V. 

2) Next, we show that r — 1 rounds are not sufficient. 
Let uq and I be two vertices, such that the shortest 
path between them is of length r. Denote this path 
(uq, Ui,U 2 , • ■ • = ()■ Then, the shortest path be¬ 

tween uq and Vi is i for all i G [r]. Assume that 
VvQ = [n] and for any i G V\{uo}, Vi = 0. Assume 
that n > 1 and Tt = {!}• Then, clearly, after one 
iteration V 2 can not know Xi (because the shortest 
path from vq to V 2 is of length two, and the same 
symbol is not retransmitted within the same round.) 
More generally, for the same reason, for any i G [r — 1], 
after i iterations Vi+i can not know xi. 

■ 

Before we prove Theorem IIV. II we formulate and prove 
the following two lemmas. 

Lemma IX.l. ( 01/ ) Let V be an ambient space and W C 
V be a linear subspace. If there is a vector x G V such that 
X W, then there exists y G W'^ such that x ■ y f 0. 

Proof: Let a; G L be such that x ^ W. By contrary, 
assume that for all y G W-^- we have x ■ y = 0. Then, 
X G (VL■*■)■*■ = W. This is in contradiction to the conditions 
of the lemma. ■ 

Lemma IX.l. Let V be an ambient space and W C V 
be its linear subspace. If x £ W'^, then for every subspace 
W C W and for every vector y G W it holds x ■ y ~ Q. 

Proof: Let x G W'^. Pick any y G W. Then y G W. 
We obtain that x ■ y = Q. ■ 

Proof: (Theorem \IV.1\ 

The statement of the theorem is proven in two steps. 

1) We construct an exact coding scheme, which uses r 
symbol transmissions over F. We show that under this 
scheme, for all £ G V, the node i can recover the bit 
x^ for all p G Tf. 

Let A G A be the matrix, which minimizes the value 
of T in (|2l. Assume that Equation (Hji holds, and take 
some p G 7i, f G V. Then, 

^ ^ , 

ieWn(^).jeH je[n] 

where all aij and 13j are in F. Then, 

x.q — tc 

j&bT,] jG[n] 

Each (sending) node i G V will use some basis Bi 
of the rowspace of Ai, and will transmit the messages 
{b-x) G F for all b G Bi. It is straightforward to verify 
that in such way each node transmits combinations of 
bits that it has in its possession. The total number of 
messages that the node i transmits is rank(Ai) and the 


total number of messages transmitted in the scheme is 
rank(Ai) . 

zGV 


Each (receiving) node £ G V will be able to compute 
the values ■ x) for all i G Afin{£), j S VA-, from 
the messages (b-x). It will also be able to compute 
the values ■ x for all j € [n]. Therefore, the node 
£ will be able to compute Xrj, as required. 

2) We show that if there exists another linear code which 
satisfies the requests of all the nodes in Q, then it 
is possible to construct a corresponding matrix A as 
in Equation (IHi, which satisfies the conditions of the 
theorem. 

Consider the transmission scheme with the optimal 
number of transmissions Topt. Assume that for each 
^ G V, the node £ transmits messages of the form 
• at, J G [ne], 0 <ne <n. Here = '^opt is 

the total number of transmissions. 

Next, we show that for all £ G V and for all 77 G [n], if 
77 G 7r then the vector G F" belongs to Wr C F”, 
where We is the linear span of the vector set 



Eix some £ G V. By contrary, assume that ^ We. 
Then, by Lemma HX. 1 1 there exists x G W^ such that 

■ X 0. 

From the definition of We and Lemma IIX.21 we have 
that X ■ = 0 for all j G Nin{£), * G [nf, and 

a: • = 0 for all j GVi. This means that: 

(i) 

(i) the transmitted messages x ■ sy are 0 for every 
transmitter j G ffin{£), i C [nj]', 

(ii) the side information symbols Xi, i &Ve, available 
to the node £ are all 0. 


Thus, the node £ cannot distinguish between the in¬ 
formation vector X and the zero vector 0. However, 
Xrf f- 0. Therefore, our assumption that Sjj ^ W is 
false. We conclude that G We. 

Next, we construct the nxn matrices Ae for all £ G V. 
For that sake, we take 



if 7 G [ne] 

0 otherwise 


We obtain that for each £ G V, rank(A^) < ne, and 
therefore 

X]rank(A^) < '^opt ■ 


By construction, the resulting A belongs to the family 
A, and therefore the corresponding code satisfies equa¬ 
tion ©. We conclude that t in expression (O is indeed 
the minimum number of transmissions. 


Proof: (Proposition IV/.il ) 














The proof of this proposition is straightforward: let £ € V 
be the node that maximizes the expression ©. Then, at least 
d^ transmissions are needed in order to satisfy all the requests 
of ■ 

Proof: (Corollary \VI.3\j 

We have: 

= (£) 0 Ef ■ 

= (D^ E^) ■ 

= {D^ 0 n^-^E) ■ A(°) 

= 0^;) • A(°) 

= (£)* 0 E) ■ A(°) 

Here, the transition (§) holds due to Remark IVl. II Thus, any 
non-zero integer entry in ®E) is mapped to the element 
1 € F, and, therefore, the factor can be omitted. ■ 

Before we turn to proving Theorem IVI.4I we formulate 
and prove the following lemma. 

Lemma IX.3. Let Q be a directed graph defined by the 
adjacency matrix . Let the possession matrix family of 
the graph Q be A as defined in Equation (O- There exists a 
transmission matrix A G A such that 


’ 

diag 

( 77 M) 

07 ' 

A ■ 



r,(A) 



= max-rank ((diag(ej) 0 I) ■ [D ® E) ■ A) (11) 
for all j £ V. 


Proof: We analyze the left and the right-hand side of 
equation (QOll separately. 

1) The right-hand side of equation (fTol i can be written as 

(diag(ej) 0 J) • (D 0 ■ A 

= (diag(ej) 0/) • (D 0 L;) • (A 0 1„) 

= (diag(ej ) • D • A) 0 (7 • • 1„) 

= (diag(ej ) • D • A) 0 (nl„) 

*■= (diag(ej) ■ D ■ A) 0 1„ . 

The equality (^) holds because n > 0 and all non-zero 
integers are mapped to field element 1, thus we can 
omit the factor n. 

By employing the notation in (IVI-Al i. the equation ([Sll 
holds. Then, the )-th row of the matrix diag(ej) ■ D-A 
is {S 3 ,v)r,G[n]- We have: 


diag(ej) ■ D ■ A = 


0 

0 

0 

0 

Siev 


0 

0 

0 

0 


Next, it is straightforward to verify that the max-rank 
of the matrix family (diag(ej) • 7T ■ A) 0 1„ is the 
number of the symbols in the non-zero row of the 
matrix in Equation (fT2l i. 

2) Consider the upper-block part of the matrix in the left- 
hand side of ([Tol l. Denote 


^ — {0-i,j)iGlkn] ■ 

jeH 

In the sequel, we show that the values of the elements 
dij in A can be chosen such that the equation (fT^ 
holds. 

The matrix diag(T)t'ii) 07 is the diagonal block matrix, 
namely. 



r i>.M 

0„ 


On 1 

diag(D[^I)07 = 

On 



On 






L On 

On 


. 


where for all i £ V, Dyi is a diagonal n x n matrix 
as follows: 




dj^i 0 
0 dj^i 


0 

0 


0 0 • • • dj^i 


and 0„ is an n X n all-zero matrix. 

Then, (diag(7)f'^^) 0 7) • A = 

dj,i ■ di,i • • • dj^i ■ di^n 


dj,i 




dj.i 


^n,n 


dj^k ' d(^k — l)n+l,l ' * ■ dj^k ' ^(fc—l)n+l,n 
dj^k ' ^knp ' ' ’ ' dkn,n 

(13) 

The element in the j-th row and £-th column in the 
matrix in Equation (fT2l i is if there exists i such 
that dj^i f 0 and di^i = In that case, we can 
pick s £ [n] and set ^ to 1. We obtain that 

l)n-t-s4 ^ O- 

Since s £ [n], different s can be chosen for every £ £ 
[n]. After setting to 1 in every column £ £ 

[n], we set the values of all other elements in A to 0. 
Because the ones in the matrix in Equation ( fTSl ) are 
all in the distinct rows and in the distinct columns, the 
rank of the matrix (diag(73f'l]) 0 7) • A equals to the 
max-rank of the family (diag(ej) • 7) ■ A) 0 
If dj/ = then for s £ [n], 

and only these elements are set to 1 in A. Therefore, 
A G A. 
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From the construction of ^4 G A, we have that 
^diag j (g) • A has a single one in some row, 

for every column where there is in (diag(ej) ® J) • 
(£> ^ E) ■ A. The transposed adjacency matrix D has 
ones in the main diagonal. Therefore, if there exists a 
column with in (diag(ej) • Z) • A) (g 1„, then there 
also exists in the same column of A. 

Thus, if there is a single one in a row in any column 
of ^diag g / j A, then there is a single one 

in a row in the corresponding column of rj(A). As 

rank ^^diag j g 

= max-rank ((diag(ej) g J) • (£) g • A) , 

then 


rowspace (rj(A)) 

C rowspace ^^diag j g ■ A^ , 
and condition (fTTT i holds. 


The proof of the last lemma showed that the transmission 
matrix A exists. However, it may not be optimal. We next 
turn to proving Theorem IVI.4I 


Proof: (Theorem \VI.4\l 

From Lemma I IX. 31 there exist matrices satisfy¬ 

ing ([Toll. Take any such matrices, and write them as 


^(0 



p G [kn] 

V 6 [n] 


For all s G V, let the vectors 


f(i) = ( +0) +0) 

^ S V I^S7’1^^S7’2^ 



Since fs.r,m = messages received by the 

node t are the entries of the vector given by: 


J 

“1,1 ■ “l,l 


“l,l ■ “ri.l 

1 {i) 

• ^1,1 ' (^71,71 


4 (0 J K'f'} 

T ■ ®(fe-l)n+l,l ■ ■ ■ ■ ^{k-l)n+l, 


j Ai) 
“fed ■ 


(0 


j 

■ ^kn,n 


Xi 

X2 


(15) 


From Lemma rVl.21 the matrix family (D* g £J) ■ A is the 
possession matrix of the network after the round i. Thus, the 
matrix family (diag(ej) g I) ■ (£>* g • A is the possession 
matrix of the node j after the round i. The max-rank of this 
matrix family is the number of symbols the node j has after 
the completion of the i-th round of the protocol. 

To this end, the matrices as above satisfy the con¬ 
dition Coll, and the number of transmission in the protocol 
based on it is given by the right-hand side of the equality (|9ll. 
Therefore, in order to minimize the number of transmissions 
in the protocol, one has to choose the matrices that 
isfy ( [Tol l and minimize the right-hand side of the equality 


sat- 

( 0 . 


be the r-th row of . These vectors can be viewed as the 
linear coefficients multiplying the symbols transmitted by the 
node s during the i-th round of the protocol. In the Ath round 
of the protocol, the messages transmitted by the node s are 
given by the non-zero vectors in 


s 


E 

m€[n] 


f(.i) 

^s,r,m 


re[n] 



, (14) 


where x = {xi,X 2 , •' ‘ i Xn)'^■ 

The number of transmissions of the node s during the i-th 
round of the protocol is the rank of A^f^. When summing for 
all s G V, we obtain the number of transmissions r as stated 
in the right-hand side of Theorem IVI.4I with respect to this 

Observe that the node i receives all the messages from the 
node s if d(,^s f- 0. Therefore, the node i receives all the 
messages of the form 


• E 

mG [n] 



for all s G V, r G [n]. 
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